Entry Name: CRPGL-BROEKSEMA-MC1

VAST Challenge 2014
Mini-Challenge 1

Team Members:

We're both research scientists at the Centre de Recherche Public – Gabriel Lippmann (CRPGL) in Luxembourg.

Student Team:

No

Analytic Tools Used:

Note: As we had very limited time, we used out of the box tools for the analysis. One of the things we found was that the data was so messy, that most of the time was spent on reshaping it in a more usable format. During this process, we mostly wrote a number of R scripts that performed data cleansing, merging and extraction. A couple of helper function and the interactive interface of RStudio were used to make queries on the extracted tables to follow leads. As such, it can be questioned if we used “visual” analytics.

On the other hand, we questioned ourselves, about the kind of tools that would be needed to do the cleansing and the analysis and would provide an interactive visual interface. Actually, this tool shouldn't help so much with the analysis as well with the cleansing. Moreover, this tool should be easy enough to use under time pressure (i.e. it should take very little time to create various interactive visualizations for graph data and multivariate data), while on the other hand providing enough flexibility to handle highly unstructured data. The exercise gave us much inspiration for the design of future visual analytic techniques to tackle problems with data as complex and messy as the data in this challenge.

Approximately how many hours were spent working on this submission in total?

100 hours - Approximately one week full time by both team members

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2014 is complete?

Yes

Video:

CRPGL-BROEKSEMA-MC1

Questions

MC1.1Provide a visual representation of the structure of the Protectors of Kronos network, with supporting evidence.

Provide novel visualizations appropriate for communicating key information to the busy leaders of the investigation. Please limit your response to no more than eight images and 500 words.

a.      Who are the leaders?

One of the tools we used to find out about Elian's death is a simple bar plot. We first subsetted the articles from the start of 2009 to the end of 2013. Next we created a bar plot for the number of messages found per month. We see some obvious peek, for example at about mid 2012. Reading the articles in this time span, discloses that there is a meeting to remember the dead of Elian Karel, which in turn triggered us to search for the messages mentioning his dead (which explain the peek in early 2009).



b.     Who is part of the extended network?

c.      How has the group structure and organization changed over time?

We used Gephi to visualize the named entity graph extracted by OpenCalais. To find entities of interest and identify common mis-spellings calling for further data cleansing, we used Gephi's interactive functions such as node/edge filtering and merging. The example image below shows the largest connected components extracted from the 5 year report historical document. Nodes are colored based on Louvain clustering. Entities and their relationships were extracted on a per paragraph basis.

d.     Where are the potential connections between the POK and GAStech?

MC1.2Describe the events of January 20-21, 2014. What is the timeline of events? Please limit your response to no more than ten images and 500 words.

We used Gephi to visualize the named entity graph extracted by OpenCalais. To find entities of interest and identify common mis-spellings calling for further data cleansing, we used Gephi's interactive functions such as node/edge filtering and merging. The example image below shows all article and entities extracted from articles written on march 20th. Nodes are colored by cluster (Louvain clustering) and sized based on betweenness centrality.

### January 20

### January 21

MC1.3Identify at least two possible explanations why the GAStech employees may be missing. What evidence do you have to support each of these explanations? Please limit your response to no more than three additional images and 200 words.

  1. They were taken by the protectors of Kronos. The GAStech meeting was infiltrated by POK members posing as caterers. POK members who worked for GAStech were able to organise this as they were involved in the organization of the catering for the event. These employees may have been involved with the Asterian people’s army, a paramilitary organization involved in the drugs trade as indicated by the infection of the GAStech computers with a virus that produces Asterian People's Army themed spam (using their magazine title “ARISE”).

  2. The employees have been kidnapped, but not by the POK. The Asterian Peoples Army (APA) traffics drugs and Carmine Bodr0gi sold drugs for them and was arrested. His relative Loreto Bodrogi, may have been susceptible to compromising himself and his colleagues due to criminal pressure (due to the arrest), and supplied the APA with the necessary information to organise the kidnapping as well as access to GAStech computer system. The computers of the GAStech employee who appear to have been sending pro Kronos defence messages, have been deliberately infected with a virus to make them look like POK sympathizers a week before the kidnap. The motive is purely ransom for profit.